On Continuous-Action Q-Learning via Tile Coding Function Approximation

نویسنده

Alexander A. Sherstov

چکیده

Reinforcement learning (RL) is a powerful machine-learning methodology that has an established theoretical foundation and has proven effective in a variety of small, simulated domains. There has been considerable work on applying RL, a method originally conceived for discrete state-action spaces, to problems with continuous states. The extension of RL to allow continuous actions, on the other hand, has seen relatively little research. One proposed approach to allowing continuous actions is to represent the value function using a tile-coding function approximator. We introduce a simulated domain for the controlled study of this method in conjunction with Q-learning and report empirical results on its performance under different parameterizations. Our experimental findings contribute a deeper understanding of the workings of tile coding in continuous-action domains, provide guidance to parameter choices, and point out an improvement on this method which we verify empirically.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Tile Coding Based on Hyperplane Tiles

In large and continuous state-action spaces reinforcement learning heavily relies on function approximation techniques. Tile coding is a well-known function approximator that has been successfully applied to many reinforcement learning tasks. In this paper we introduce the hyperplane tile coding, in which the usual tiles are replaced by parameterized hyperplanes that approximate the action-valu...

متن کامل

Function Approximation via Tile Coding: Automating Parameter Choice

Reinforcement learning (RL) is a powerful abstraction of sequential decision making that has an established theoretical foundation and has proven effective in a variety of small, simulated domains. The success of RL on realworld problems with large, often continuous state and action spaces hinges on effective function approximation. Of the many function approximation schemes proposed, tile codi...

متن کامل

Reinforcement Learning applied to Keepaway, a RoboCup-Soccer Subtask

This Bachelor Final Project aims to be a demonstration of the power and usefulness of reinforcement learning, especially for RoboCup-Soccer. In the first part general theories behind reinforcement learning are described. Different kinds of basic solving methods are compared; Value iteration, Policy iteration, Monte Carlo methods, Sarsa and Q-Learning. Eligibility traces are added to the basic m...

متن کامل

A Q-learning Based Continuous Tuning of Fuzzy Wall Tracking

A simple easy to implement algorithm is proposed to address wall tracking task of an autonomous robot. The robot should navigate in unknown environments, find the nearest wall, and track it solely based on locally sensed data. The proposed method benefits from coupling fuzzy logic and Q-learning to meet requirements of autonomous navigations. Fuzzy if-then rules provide a reliable decision maki...

متن کامل

Adaptive Tile Coding for Value Function Approximation

Reinforcement learning problems are commonly tackled by estimating the optimal value function. In many real-world problems, learning this value function requires a function approximator, which maps states to values via a parameterized function. In practice, the success of function approximators depends on the ability of the human designer to select an appropriate representation for the value fu...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2004

On Continuous-Action Q-Learning via Tile Coding Function Approximation

نویسنده

چکیده

منابع مشابه

Tile Coding Based on Hyperplane Tiles

Function Approximation via Tile Coding: Automating Parameter Choice

Reinforcement Learning applied to Keepaway, a RoboCup-Soccer Subtask

A Q-learning Based Continuous Tuning of Fuzzy Wall Tracking

Adaptive Tile Coding for Value Function Approximation

عنوان ژورنال:

اشتراک گذاری